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Abstract 

Music composition used to be a pen and paper activity. These these days music is of- 
ten composed with the aid of computer software, even to the point where the computer 
compose parts of the score autonomously. 

The composition of most styles of music is governed by rules. We show that by approach- 
ing the automation, analysis and verification of composition as a knowledge representation 
task and formalising these rules in a suitable logical language, powerful and expressive in- 
telligent composition tools can be easily built. 

This application paper describes the use of answer set programming to construct an 
automated system, named Anton, that can compose melodic, harmonic and rhythmic 
music, diagnose errors in human compositions and serve as a computer-aided composi- 
tion tool. The combination of harmonic, rhythmic and melodic composition in a single 
framework makes Anton unique in the growing area of algorithmic composition. 

With near real-time composition, Anton reaches the point where it can not only be 
used as a component in an interactive composition tool but also has the potential for live 
performances and concerts or automatically generated background music in a variety of 
applications. With the use of a fully declarative language and an "off-the-shelf" reasoning 
engine, Anton provides the human composer a tool which is significantly simpler, more 
compact and more versatile than other existing systems. 

To appear in Theory and Practice of Logic Programming (TPLP) 

KEYWORDS: Answer set programming, applications of ASP, music composition, algo- 
rithmic composition, harmonic and melodic composition, diagnosis 



1 Introduction 

Music, although it seeks to communicate via emotions, is almost always governed by 
complex and rigorous rules which provide a foundation on which artistic expression 
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can be based. In the case of musical composition, in most styles there are rules 
which describe the progression of a melody, both at the local level (the choice of 
the next note) and at the global level (the overall structure). Other rules describe 
the harmony, which arises from the relationship between the melodic line and the 
supporting instruments. A third set of rules determines the rhythm, the intervals 
between notes, of a piece. 

These rules were developed to guide and support human composers working in 
the style of their choice, but we wish to demonstrate here that by using knowledge 
representation techniques, we can create a computer system that can reason about 
and apply compositional rules. Such a system will provide a simple and flexible 
way of composing music automatically. Provided that the representation technology 
used is sufhciently flexible to allow changes at the level of the rules themselves, the 
system will also help the human composer to understand, explore and extend the 
rules (s)he is working with. 

Since the beginning of recorded time, music composers have used a number of 
processes to generate the next note, be it simple scales or arpeggios, or complex 
mathematical structures. The interest in developing computational systems for com- 
posing, harmonising and accompanying music is not new either. Researchers have 
used a variety of mechanisms in searching for a viable system, including encodings 
of the stochastic and symbolic music of Xenakis (1992), and attempts to find sim- 
pler schemes to the major work in artificial intelligence on the harmonisation of 
Bach chorales by Ebcioglu (1986). 

This paper describes Anton, an automatic composition system capable of simple 
melodies with accompaniment, in particular for species one counterpoint as prac- 
tised in the early Renaissance. The insights gained from this system can be and are 
being extended to other musical styles, by adding or changing the rules. What has 
impressed us has been particularly the ease in which musical experience could be 
converted into code which is succinct and easy to verify. 

Anton uses Answer Set Programming (ASP) (Gelfond and Lifschitz 1988), a 
logic programming paradigm, to represent the music knowledge and rules of the 
system. A detailed description is provided in Section [3] after a short description of 
the musical aspects of the project (Section [2]). 

The initial system, Anton v. 1.0, was first presented in (Boenn et al. 2008 ). In this 
paper, we present Anton v. 1.5. The simplicity of the basic encoding is presented in 
Section[4j As we will demonstrate, the new version excluding rhythm is significantly 
faster allowing Anton to be used in real-time application rather than just as an 
interactive tool. 



The initial system ( Boenn et al. 2008 ) , has only an extremely simple concept of 
rhythm, all notes having the same length. In Section [5] we show how this musical 
restriction has been relaxed, allowing interesting and "correct" rhythmic patterns. 
The paper is completed with a discussion of the performance, both musical and 
computational in Section[6] the use of ASP in Section[7]and future work in SectionjS] 

Our overall aim is multi-faceted; on the one hand we want develop musicological 
ideas, create music, and test musical thoughts and on the other can use the system 
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to test the quality and utility of ASP solvers in real-world application^ We also 
note that Anton is usable as part of a student marking system, checking that 
harmonisations fit the rules. Alternatively, the system can be used as a diagnostic 
and assisted composition tool, where a part of the piece is given to the program to 
be correctly completed. 



2 Music Theory 

Music is a world-wide phenomenon across all cultures. The details of what con- 
stitutes music may vary from nation to nation, but it is clear that music is an 
important component of being human. 

In the work of this paper we are concentrating on western traditional tonal musics, 
but the underlying concepts can be translated to other traditions. The particular 
area of interest here is composition; that is creating new musical pieces. 

Creating melodies, that is sequences of pitched sounds, is not as easy as it sounds. 
We have cultural preferences for certain sequences of notes and preferences dictated 
by the biology of how we hear. This may be viewed as an artistic (and hence 
not scientific) issue, but most of us would be quick to challenge the musicality 
of a composition created purely by random whim. Students are taught rules of 
thumb to ensure that their works do not run counter to cultural norms and also 
fit the algorithmically definable rules of pleasing harmony when sounds are played 
together. 

"Western tonal" simply refers to what most people in the West think of as "clas- 
sical music", the congenial Bach through Brahms, music which feels comfortable 
to the modern western ear because of its adherence to familiar rules. Students of 
composition in conservatoires are taught to write this sort of music as basic train- 
ing. They learn to write melodies and to harmonise given melodies in a number of 
sub-versions. If we concentrate on early music then the scheme often called infor- 
mally "Palestrina Rules" is an obvious example to used for such a task. Similarly, 
harmonising Bach chorales is a common student exercise, and has been the subject 
of many computational investigations using a variety of methods. 

For the start of this work we have opted to work with Renaissance Counterpoint. 
This style was used by composers like Josquin, Dufay or Palestrina and is very 
distinct from the Baroque Counterpoint used by composers like Bach, Haendel. 

We have used the teaching at one conservatoire in Koln to provide the basic 
rules, which were then refined in line with the general style taught. The point about 
generating melodies is that the "tune" must be capable of being accompanied by 
one or more other lines of notes, to create a harmonious whole. The requirement for 
the tune to be capable of harmonisation is a constraint that turns a simple sequence 
(a monody) to a melody. 

In this particular style of music complete pieces are not usually created in one 



^ This paper is not intended to be a benchmarking paper for the various solvers and their numerous 
option. In this paper we simply wish to demonstrate that ASP is an appropriate paradigm for 
music composition and that composition can be done in near real-time. 
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go. Composers create a number of sections of melody, harmonising them as needed, 
and possibly in different ways, and then structuring the piece around these basic 
sections. Composing between 4 bars and 16 bars is not only a computationally 
convenient task, it is actually what the human composer would do, creating com- 
ponents from which the whole is constructed. So although the system described 
here may be limited in its melodic scope, it has the potential to become a useful 
tool across a range of sub-styles. 



2.1 Automatic Composition 

A common problem in musical composition can be summarised in the question 
"where is the next note coming from?" . For many composers over the years the 
answer has been to use some process to generate notes. It is clear that in many 
pieces from the Baroque period that simple note sequences are being elaborated in 
a fashion we would now call algorithmic. For this reason we can say that algorithmic 
composition is a subject that has been around for a very long time. It is usual to 



credit Mozart's Musikalisches Wiirfelspiel (Musical Dice Game) (Chuang 1995) as 
the oldest classical algorithmic composition, although there is some doubt if the 
game form is really his. In essence the creator provides a selection of short sections, 
which are then assembled according to a few rules and the roll of a set of dice to 
form a Minue10 Two dice are used to choose the 16 minuet measures from a set of 
176, and another die selects the 16 trio measure^ this time from 96 possibilities. 
This gives a total number of 1.3 x 10^^ possible pieces. This system however, while 
using some rules, relies on the coherence of the individual measures. It remains a 
fun activity, and recently web pages have appeared that allow users to create their 
own original(ish) "Mozart" compositiontj^ 

In the music of the second Viennese school ("12-tone", serial music) there is 
a process of action, rotating, inverting and use of retrograde, but usually this is 
performed by hand. 

More recent algorithmic composition systems have concentrated on the genera- 
tion of monod}]^ either from a mathematical sequence, chaotic processes, or Markov 
chains, trained by consideration of acceptable other works. Frequently the systems 
rely on a human to select which monodies should be admitted, based on judgement 
rather than rules. Great works have been created this way, in the hands of great 



talents. Probably the best known of the Markov chain approach is Cope (2006 )'s 
significant corpus of Mozart pastiche. 

In another variation on this approach, the accompanist, either knowing the chord 
structure and style in advance, or using machine-listening techniques, infers a style 



^ A dance form in triple time, i. e. with 3 beats in each measure 
^ A Trio is a short contrasting section played before the minuet is repeated 
* For example http://www.softsynth.com/jsyii/excmiples/dicegaine/ 
http: //jmusic . ci . qut . edu. au/jmtutorial/MozartDiceGeune .html 
http: //www. amaranthpublishing. com/Hozar tDlceGame ■ htm[ 
http : //imagine . xs4all . nl/bram/mozart 

http: //magyar-irodalom. elte .hu/kirnberger/_html/mozart_e .html 

^ A monody is a single solo line, in opposition to homophony and polyphony 
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of accompaniment. The former of these approaches can be found in commercial 
products, and the latter has been used by some jazz performers to great effect, for 
example by George E. Lewis (2000). 

A more recent trend is to cast the problem as one of constraint satisfaction. 
For example PWConstraints is an extension for IRCAM's Patchwork, a Common- 
Lisp-based graphical programming system for composition. It uses a custom con- 
straint solver employing backtracking over finite integer domains. OMSituation and 
OMClouds are similar and are more recently developed for Patchwork's successor 
OpenMusic. A detailed evaluation of them can be found in (Anders 2007), where 
the author gives an example of a Ist-species counterpoint (two voices, note against 
note) after (Fux 1725) developed with Strasheela, a constraint system for music 
built on the multi-paradigm language Oz. Our musical rules implement the melody 
and counterpoint rules described by (Thakar 19901, which we find give better mu- 
sical results. 

One can distinguish between improvisation systems and composition systems. In 
the former the note selection progresses through time, without detailed knowledge of 
what is to come. In practice this is informed either by knowing the chord progression 
or similar musical structures (Brothwell and ffitch 2008), or using some machine 
listening. In this paper we are concerned with composition, so the process takes 
place out of time, and we can make decisions in any order. 

It should also be noted that these algorithmic systems compose pieces of music of 
this style in either a melodic or a harmonic fashion, and are frequently associated 
with computer-based synthesis. The system we will propose later in this paper is 
unique as it deals with both simultaneously. 



2.1.1 Melodic Composition 

In melodic generation a common approach is the use of some kind of probabilistic 
finite state automaton or an equivalent scheme, which is either designed by hand 
(some based on chaotic oscillators or some other stream of numbers) or built via 
some kind of learning process. Various Markov models are commonly used, but 
there have been applications of n-grams, genetic algorithms and neural nets. What 
these methods have in common is that there is no guarantee that melodic fragments 
generated have acceptable harmonisations. Our approach, described below is fun- 
damentally different in this respect, as our rules cover both aspects simultaneously. 

In contrast to earlier methods, which rely on learning, and which are capable 
of giving only local temporal structure, a common criticism of algorithmic melody 



( Leach 1999 ) , we do not rely on learning and hence we can aspire to a more global, 
whole melody, approach. Furthermore, learning is designed to work in one direction 
at the time which makes it hard to use in a partially automated fashion. Our systems 
is not bound to these limitation, so operations like "fill in the 4 notes between these 
sections" are not a problem for us. 

Researchers are also trying to move beyond experiments with random note gen- 
eration because the results are too lacking in structure. Predictably, the alternative 
of removing the non-determinism at the design stage (or replacing with a proba- 
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bilistic choice) runs the risk of 'sounding predictable' ! There have been examples 
of good or acceptable melodies created like this, but the restriction inherent in the 
process means it probably works best in the hands of geniuses. 

Our experience with this work made us realise how many acceptable melodies 
can be created with only a few rules, and as we add rules, how much better the 
musical results are. 



2.1.2 Harmonic Composition 

A common usage of algorithmic composition is to add harmonic lines to a melody; 
that is notes played at the same time as the melody that are in general consonant 
and pleasing. This is exemplified in the harmonisation of 4-part chorales, and has 
been the subject of a number of essays in rule-based or Markov-chain systems. 



Perhaps a pinnacle of this work is Ebcioglu (1986) who used early expert system 



technology to harmonise in the style of Bach, and was very successful. Subsequently 
there have been many other systems, with a range of technologies. A review of these 



is included in ( Rohrmeier 2006 ) 



Clearly harmonisation is a good match to constraint programming based systems, 
there being accepted rule^ It also has a history from musical education. 

But these systems all start with a melody for which at least one valid harmoni- 
sation exists, and the program attempts to find one, which is clearly soluble. This 
differs significantly from our system, as we generate the melody and harmonisation 
together, the requirement for harmonisation affecting the melody. 



3 Ansvi^er Set Programming 



Answer Set Programming (Baral 2003 Gelfond and Lifschitz 1988) is a declarative 



programming paradigm in which a logic program is used to describe the require- 
ments that must be fulfilled by the solutions of a certain problem. The answer 
sets of the program, usually defined through (a variant/extension of) the stable 
model semantics ( Gelfond and Lifschitz 1988[ ), are interpreted to the solutions of 
the problem. This technique has been successfully applied in domains such as plan- 



ning (Eiter et al. 2002 Lifschitz 2002), configuration and verification (Soininen 



and Niemela 1999), super-optimisation (Brain et al. 2006), diagnosis (Eiter et al 



1999), game theory (De Vos and Vermeir 1999) multi-agent systems (Baral and 



Gelfond 2000; 'Buccafurri and Gottlob 20021 De Vos and Vermeir 2004 Buccafurri 



and Caminiti 2005, ^Cliffe et al. 2006[), rea soning about biologic al networks ([Grell 



et al. 20061, voting theory (Konczak 2006), policy mechanisms (Mileo and Schaub 



2006), generation of phylogenetic trees (Erdem et al. 2006), evolution of language 



( Erdem et al. 2003 1 and game character descriptions ( Padovani and Provetti 2004 



There is a large body of literature on ASP: for in-depth coverage see ( Baral 2003 



but for the sake of making this paper self-contained we will cover the essentials as 
they pertain to our usage here. 



^ For example see: http: //www. wikihow. com/HcLrmonise-a-Chorale- in- the- Style- of -Bach 
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Basic Concepts: The answer set semantics is a model based semantics for normal 
logic programs. Following the notation of (Baral 2003), we refer to the language of 
normal logic programs under the answer set semantics as AnsProlog. 

The smallest building block of an AnsProlog program is an atom or predicate, 
e.g. r(X,Y) denotes XrY. e.g owns{X,Y) stating that X owns Y. X and Y are 
variables which can be grounded with constants, e.g. owns(alice,key). Each ground 
atom can be assigned the truth value true or false. 

In this paper we will only consider programs with one type of negation, namely 
negation-as-failure denoted not. This type of negation states that something should 
be assumed false when it cannot be proven to be true. A literal is an atom a or its 
negation not a, with not not a ^ a. We extend the notation to sets: not S is the 
set {not I \ I G S} with S a set of literals. 

An AnsProlog program consist of a finite set of statements, called rules. Each 
rule r : a -s— i?. or _L -i— _B. is made of two parts namely the body B, denoted 
Br, which is a set of literals, and a head atom a or _L, denoted H^- The body can 
be divided in two parts: the set of positive atoms, denoted as Bf, and the set of 
negated atoms, denoted B^ . A rule should be read as: "a is supported if all elements 
of B are true" . A rule with empty body is called a fact and we often only mention 
the head. A rule with head _L is referred to as an (integrity) constraint. We often 
omit the _L symbol and leave the head empty. _L is always assigned the truth value 
"false" . A program is called positive if it does not contain any negated literals. 

The finite set of all constants that appear in the program P is referred to as the 
Herbrand Universe, denoted Up. Using the Herbrand universe, we can ground the 
entire program. Each rule is replaced by its ground instances, which can be obtained 
by replacing each variable symbol by an element of Up. The ground program, 
denoted ground{P), is the union of all ground instances of the rules in P. 

The set of all atoms grounded over the Herbrand universe of a program is called 
the Herbrand Base, denoted as Bp. These are exactly those atoms that will appear 
in the ground program. 

An assignment of truth values to all atoms in the program (or all elements from 
the Herbrand base) is called an interpretation. Often only those literals that are 
considered true are mentioned, as all the others are false by definition (negation as 
failure) . 

Given a ground rule r, we say a r is applicable w.r.t. an interpretation / C Bp if 
all the body elements are true (i?+ C / and B^ C\I — %). The rule is applied w.r.t. / 
when it is applicable and £ I. A ground rule is satisfied w.r.t. an interpretation 
/ if it is either not applicable or applied w.r.t. /. An atom is supported w.r.t. / if 
there is an applied rule with this atom in the head. Obviously, we want to make 
sure that interpretations satisfy every rule in the program. So, an interpretation / 
is a model for a program P if and only if all rules in ground{P) are satisfied. 

To find actual solutions, models alone are not sufficient: we need to make sure 
that only those literals that are supported are considered true. This results in the 
so-called minimal model semantics. A model M for a program P is minimal if no 
other model N exists such that N C M . Programs can have any number of minimal 
models, while programs without constraints will always admit at least one. Positive 
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programs will have at most one minimal model, and exactly one when they do not 
admit any constraints. 

The minimal model of a positive program without constraints can be found using 
a fixpoint, called the deductive closure, which can be computed in polynomial time. 
We start with the empty set and find all atoms that are supported. With this new 
set we continue to find supported atoms. When the set reaches a fixpoint, we have 
found the deductive closure or minimal model of the program. When the program 
contains constraints, we can follow the same principle but the process fails when a 
unsatisfied constraint is found. 

Definition 1 

Let P be a positive AnsProlog program and let / be an interpretation. We define 

the immediate consequence operator Tp as: 

Tp{I)^{aeBp\3reP- Br C /} 

Tp is monotonic so it has a least fixpoint, denoted Tp'^(0), which corresponds to 
the deductive closure. 

Definition 2 

Let P be a positive AnsProlog program. The deductive closure of P, is the least 
fixpoint Tp"(0) of Tp. 

The minimal semantics is sufficient for positive programs, but fails in the presence 
of negation-as- failure. A simple example of such a program is: {a not a.; b ^ a}. 
This program has one minimal model {a, 6}, while the truth of a depends on a be- 
ing false. To obtain intuitive solutions, we need to verify that our assumptions are 
indeed correct. This is done by reducing the program to a simpler program contain- 
ing no negation-as-failure. Given an interpretation, all rules that contain not I that 
are considered false are removed while the remaining rules only retain their body 
atoms. This reduction is often referred to as the Gelfond-Lifschitz transformation 



(Gelfond and Lifschitz 1988 Gelfond and Lifschitz 1991 ). When this program gives 



the same supported literals as the ones with which we began, we have found an 
answer set. 

Definition 3 

Let P be a ground AnsProlog program. The Gelfond-Lifschitz transformation of 
P w.r.t a set of ground atoms S", is the program P^ containing the rules Hr P,t 
such that B+, not B' € P with P" n S" = 0. 



Definition 4 

Let P be a AnsProlog . A set of ground atoms S C Bp is an answer set of P iff S* 
is the minimal model of ground{P^). 

The non-deterministic nature of negation-as-failure gives rise to several answer 
sets, which are all acceptable solutions to the problem that has been modelled. It 
is in this non-determinism that the strength of answer set programming lies. 
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Extensions The basic language, single head atom and negation-as-failure only ap- 
pearing in the body, already enables the representation of many problems. However, 
for some applications the programmer is forced to write code in a more round-about 
non-intuitive way. To overcome this, several extensions were introduced. 



For this paper we use one of these extensions: choice rules (Niemela et al. 19991 



A lot of problems require choices to made between a set of atoms. Although this 
can be modelled in the basic formalism it tends choice rules are offer more clarity 
of expression and are more convenient. Choice rules, written L{li, . . . , are a 

convenient construct to state that at least L and at most M from the set {Zi, . . . , Z„} 
must be true in order to satisfy the construct. L defaults to when omitted while 
M defaults to n. Choice rules are often used in conjunction with a grounding 
predicate: L{A{X) : B{X)}M represents the choice of a number of atoms A{X) 
where is grounded with all values of X for which B{X) is true. 



Implementations: Algorithms and implementations for obtaining answer sets of 
logic programs are referred to as answer set solvers. The most popular and widely 



used solvers are smodels ( Niemela and Simons 1 997[) and dlv (jEiter et al. 1998|) 
and more recently CLASP ( Gebser et al. 2007| . 



Alternatives are CMODELS ( Giunchigha et al. 2004 ) and SUP ( Lierler 2008 1, solvers 



based on translating the program to a SAT problem, and SMODELS-ie ( Brain et al. 



2007), the cache-efficient version of SMODELS. Furthermore, there is the distributed 



solver PLATYPUS ( Gressmann et al. 2005 ) 



To solve a problem it first needs to be grounded. Currently three grounders 
are used: the grounder integrated with dlv, lparse, the grounder that was de- 
veloped together with SMODELS but is used by most solvers, and the most recent 



one GrinGo (Gebser et al. 2007) which works together with clasp and other 



solvers that take LPARSE output. During the grounding phase, not only are the 
variables substituted for constants, but also useless rules are eliminated. Further- 
more, grounders try to simplify the program as much as possible. The second phase 
is solving which takes a grounded program or its internal representation as input 
and generates the set of its answer sets. 

Current answer set solvers can be divided in three groups depending on the style 
of algorithm they use or the mapping the use: depth first search (DPLL) clause 
learning and a mapping to SAT. All use a variety of heuristics to improve the 
performance of the basic algorithm. 



4 Anton 

4-1 System Description 

Anton is an algorithmic composition system that uses ASP techniques to represent 
and reason about compositional rules. AnsProlog is used to write a description of 
the rules that govern the melodic and harmonic properties of a correct piece of 
music; in this way the program describes a model of musical composition that can 
be used to assist the composer by suggesting, completing and verifying short pieces. 
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'/J, The number of parts is given by the style #doraain part CP) . 

X'/t Each part picks at most one note per time step 

ii'/t the length of the sequence is provided at run-time 

timeCl . .t) . 

#domain timeCT). 

%°/o Each part can only play one note at a given time 
:- 2 { chosenNoteCP.T.KN) : note CNN), restCP.T) }. 

%°/o At every time step the note may change 

°/o°/t It changes by stepping (moving one note in the scale) 

%'/( or leaping (moving more than one note) 

ii'/t These can either be upwards or downwards 

1 { changes(P,T) , repeated CP ,T) , toRest(P,T), f romRest (P ,T) , 

incorrectProgression(P,T) } 1 :- T != t. 
1 { stepAt(P,T), leapAtCP.T) } 1 :- changes (P , T) , T != t. 
1 { dounAt(P,T), upAt(P,T) } 1 :- changes CP ,T) , T != t. 

stepDownCP.T) :- stepAt(P,T), downAtCP.T). 
stepUpCP.T) :- stepAtCP.T), upAtCP.T). 

#const err_ip=" Incorrect progression". 
reasonCerr_ip) . 

error CP, T, err _ip) :- incorrectProgressionCP ,T) . 

'/o'/i If we step, we must pick an amount to step by 

1 { stepByCP,T,SS) : stepSizeCSS) : SS < } 1 : - stepDownCP ,T) . 

1 { stepByCP,T,SS) : StepSizeCSS) : SS > } 1 : - stepUpCP,T). 

'/,'/( Make it so 

chosenNoteCP,T + 1,N + S) :- chosenNote CP ,T, N) , stepAtCP,T), stepBy CP , T , S) , noteCN + S) . 

chosenNoteCP,T + 1,N + L) :- chosenNote CP ,T, N) , leapAtCP,T), leapBy CP , T,L) , noteCN + L) . 

chosenNoteCP,T + 1,N) :- chosenNote CP ,T, N) , repeatedCP , T) . 



Fig. 1. A code fragment from progression. Ip 



The composition rules are modelled so that the AnsProlog program defines the 
requirements for a piece to be musically valid, and thus every answer set corresponds 
to a different valid piece. To generate a new piece the composition system simply 
has to generate an (arbitrary) answer set. 

In this section we will discuss the basic system of Anton v. 1.5. The handling of 
rhythm will be discussed in Section [s] page 15 



The AnsProlog program of the basic system is created from various files: notes . Ip, 
modes . Ip, progression . Ip, melody . Ip, harmony . Ip and chord . Ip. The first con- 
tains the general background rules on notes and intervals while the second describes 
the various modes/keys the system can use and their consequences for note selection 
and position. The current system is able to work with major, minor, Dorian, Ly- 
dian and Phrygian modes. Rules for the progression of all parts, either melodic and 
harmonic, are handled in progression. Ip. This part of the program is responsible 
for selecting the next note in each of the parts on the basis of the previous note. 
The rules for the melodic parts and for composing with multiple parts are encoded 
in melodic. Ip and harmonic. Ip respectively, chord. Ip provides the description 
of chords and chordal progression and the effects of note choices. 

We will discuss progression, melody and harmony in more detail. The whole 
system is licensed under the GPL and publicly availablf]^ 

Figure [T] presents a selection of rules dealing with the progression of notes. The 



The source code is available from jhttp : //www. cs .bath. ac .uk/-mjb/anton/ 
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7,7, Melodic parts are not allowed to repeat notes 
#const err_nrmp="Ko repeated notes in melodic parts" . 
reasonCerr_nrmp) . 

error (MP, T, err _nrmp) :- repeated(MP,T) . 

7,7, A leap of an octave is only allowed from the fundamental. 

#const err.olnf ="Leap of an octave from a note other than the fundamental". 

reasonCerr_olnf ) . 

error(MP,T,err_olnf ) :- leapBy (MP.T, 12) , not chosenChromatic(MP,T, 1) . 
error(MP,T,err_olnf ) :- leapBy (MP ,T,-12) , not chosenChromaticCHP,T, 1) . 

7,7, Impulse 

7,7, Stepwise linear progression creates impulse 

7,7, Leaps create impulse - using the notes in between resolves this 
downwardImpulseCHP,T+l) :- leapDownCMP.T) , time(T+l) . 

downwardImpulseCHP,T+3) :- stepDown(NP ,T+2) , stepDownCMP ,T+1) , stepDownCHP.T) , timeCT+3) . 
upwardImpulse(MP,T+l) :- leapUp(MP ,T) , time(T+l) . 

upwardImpulse(MP,T+3) :- stepUp(MP,T+2) , stepUp(MP,T+l) , stepUpCMP.T) , timeCT+3). 

7,7, No repetition of two or more notes 
#const err_rn="R6peated notes" . 
reasonCerr_m) . 

error (MP, Tl,err_rn) : - chosenKote(MP,Tl ,N) , stepBy(MP,Tl ,S1) , 

chosenNote(MP,T2,N) , stepBy(HP,T2,31) , 

Tl + 1 < T2, T2 < Tl + 2 + RW. 
error (MP, Tl,err_rn) : - chosenNote(MP,Tl ,N) , leapBy(MP.Tl ,L1) , 

chosenNote(MP,T2,N) , leapBy(MP,T2,Ll) , 

Tl + 1 < T2, T2 < Tl + 2 + RW. 
#const err_dc= "Dissonant contour" . 
reason(err_dc) . 

error(MP,t,err_dc) :- lowestNote (HP.Nl) , highestNote(MP,N2) , 
chromaticCNl,Cl) , chromatic(N2,C2) , 
not consonant(Cl,C2) , Nl < N2. 

Fig. 2. A code fragment from melody. Ip 

model is defined over a number of time steps, given by the variable T. The key 
proposition is chosenNote(P,T,N) which represents the concept "At time T, part 
P plays note N" . To encode the options for melodic progress ( "the tune either steps 
up or down one note in the key, leaps more than one note, repeats or rests"), 
choice rules are used. For diagnostic and debugging purposes, compositional er- 
rors are not immediately encoded as constraints, but instead use error rules like 
error (P,T, err _ip) :- incorrectProgression(P,T) . By using a constraint to 
obtain answer sets with error-atoms or excluding them entirely, we can alter the 
functionality of our system from diagnosis to composition without changing the 
code. We will later return to various applications of our system. 

To encode the melodic limits on the pattern of notes and the harmonic limits 
on which combinations of notes may be played at once, error-rules like the one in 
progression . Ip are included. Figure [2] shows how we encoded rules that forbid rep- 
etition of notes in the melodic parts, octave leaps except for special circumstances, 
impulses, repetition of more than two notes and certain lengths of intervals. While 
some of these rules might be valid in other types of music, Renaissance counterpoint 
explicitly forbids them. 

Interaction between parts is governed by the rules of harmony. Figure [3] shows 
how we encoded the musical rules that specify that you cannot have dissonant 
intervals between parts, that limit the distance between parts and that state that 
parts cannot cross-over. 

While the fragments shown in Figures [T]|3] are a subset of the knowledge base, they 
demonstrate that the rules are very simple and intuitive (with the necessary musical 
background). The modelling of this style of music, excluding rhythm, contains less 
than 400 ungrounded logic rules. 
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#const err_dibp="Dissonant interval between parts", 
reason (err _dibp) . 

errorCPl,T,err_dibp) :- chosenChromatic (PI ,T, CI) , chosenChromaticCP2,T,C2) , 
PI < P2, chroinaticIntervalCCl,C2,D) , 
not validlntervalCD) . 

%'/( The maximum distance between paxts is an octave plus 4 semitones (i.e. 16 semitones). 
#const err_mdbp="Over maximum distance between parts", 
reason (err _mdbp) . 

errorCP,T,err_mdbp) :- chosenNoteCP,T,Ml) , chosenNote (P+l ,T,N2) , 
Nl > M2 + 16, partCP+1) . 

°/„°/o Parts CcLnnot cross over, 

#const err_pcc="Parts can not cross" . 

reason (err _pcc) . 

errorCP,T,err_pcc) :- chosenNoteCP,T,Nl) , chosenNote (P+l ,T,N2) , 
Nl < N2, partCP+1) . 

Fig. 3. A code fragment from harmony. Ip 



y,% This is a quartet 
style (quartet) . 

7(71 There are four parts 
partCl. .4) . 

7(71 The top part plays the melody 
melodicPart (1) . 

7i% For chords we need to know the lowest part 
lowestPartC4) . 

7(71 We need a range of up to 2 octaves (24 steps) for each part, 

7(71 thus need 24 notes above Eind below the lowest / highest start 

#const quartetBottomNote=l . 

#const quartetTopNote=68 . 

note (quartetBottomMote . . quartet TopNote) . 

bottomNote CquartetBottomNote) . 

topNote (quartet TopNote) . 

7i7« Starting positions are 1-5-1-5 
#const err_isn="Incorrect starting note". 

reason(err_isn) . errorCl , 1 ,err_isn) :- not chosenNote (1 , 1 ,44) . 
error (2,1, err _isn) :- not chosenNote (2 , 1 , 37) . 
error (3,1, err _isn) :- not chosenNote (3 , 1 , 32) . 
error (4,1, err _isn) :- not chosenNote (4, 1 , 25) . 

7»7. No rests 

#const err_nrfw="No rest for the wicked". 

reason(err_nrfw) . 

error (P,T, err _nrfw) :- rest(P,T). 

'/>% With three or more parts allow intervals of a major fourth 
'/>% (5 semitones) between parts 
validlnterval(5) . 

Fig. 4. The quartet specification 



4^.2 Features 

In the previous section we discussed the basic components of the Anton system. 
However, in order to have a complete system, we are stih missing one component: 
the specification of parts. Currently, the system comes with descriptions for solos, 
duets, trios and quartets but the basic system is written with no fixed number of 
parts in mind. Figure [4| shows the description for a quartet. 

Depending on how the system is used, for composition or diagnosis, one will 
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I 55 60 48 50 48 60 59 62 55 57 53 55 48 52 47 48 
I GCCDCCBDGAFGCEBC 
I +5 -12 +2 -2 +12 -1 +3 -7 +2 -4 +2 -7 +4 -5 +1 

Fig. 5. The score and human readable format for the tunes composition. 

either be interested in those pieces that do not result in errors at all, or in an 
answer set that mentions the error messages. For the former we simply specify the 
constraint : - error (P,T,R) . , effectively making any error rule into a constraint. 
For the latter we include the rules: errorFound :- error (P, T, R) . and :- not 
errorFound. requiring that an error is found (i.e. returning no answers if the 
diagnosed piece is error free). These simple rules are encoding in composing. Ip and 
diagnosis . Ip so that they can be included when our scripts assemble to program. 

By adding constraints on which notes can be included, it is possible to specify a 
part or all of a melody, harmony or complete piece. This allows Anton to be used 
for a number of other tasks beyond automatic composition. By fixing the melody it 
is possible to use it as an automatic harmonisation tool. By fixing part of a piece, 
it can be used as computer aided composition tool. By fixing a complete piece, it is 
possible to check its conformity with the rules, for marking student compositions 
or harmonisations. Alternatively we could request the system to complete part of 
a piece. 

The complete system consists of three major phases; building the program, run- 
ning the AnsProlog program and interpreting the results. As a simple example 
suppose we wish to create a 4 bar piece in E major. One would write: 

programBuilder.pl — task=compose — mode=maj or — time=16 > program 

which builds the AnsProlog program, giving the length and mode. Then 

gringo < program I ./shuffle. pi 6298 I clasp 1 > tunes 

runs the solving phase and generates a representation of the piece. We provide a 



number of output formats, one of which is a CSOUND (Boulanger 2000) program 
with a suitable selection of sound templates. 

$ parse.pl — f undamental=e — output=csound < tunes > tunes. csd 
generates the Csound input from the generic format, and then 
$ csound tunes . csd -o dac 

plays the melody. We provide in addition to Csound, output in human readable 
format, AnsProlog facts or the Lilypond score language. Figure [5] shows the score 
of the tunes piece composed above. 

Alternatively we could request the system to complete part of a piece. In order to 
do so, we provide the system with a set of AnsProlog facts expressing the mode, the 
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notes which are already fixed, the number of notes in your piece, the configuration 
and the number of parts. 

Figure [6] contains an example of such file. 

We then use the system as before with the exception of adding — piece=musing . Ip 
when we run programBuilder.pl. The system can then return all possible valid 
composition that satisfy the criteria set out in the partial piece. 

Anton is not limited to the composition of music; it can also be used ver- 
ifying whether a (partial) piece of music adheres to the rules of compositions. 
Take for example the composition in Figure [7] Using the command-line options 
— task=diagnosis — piece=problems . Ip when we run programBuilder .pi, the 
system will return the error codes for all the rules that the composition breaks. In 
case of problems . Ip, we obtain: 

error (1 ,2 , "Repeated pattern") error (1 ,4, "Repeated pattern") 
error(l,2,"Split melody") 

The first two errors indicate that we have repeated patterns: e.g. from the second 
note we leap up by a perfect fourth and step down by a major second. This pattern 
is then repeated starting at the fourth and sixth notes. The repeated progression 
pattern also triggers the split melody error. Split melodies occur when the even/odd 
notes form separate melodies. 

The AnsProlog programs used in Anton v. 1.5 , contains just 509 lines (not in- 
cluding comments and empty lines) and encodes 44 melodic, harmonic and rhythmic 
rules. Once instantiated, the generated programs range from 1,000 atoms and 5,500 
rules (a solo piece with 4 notes) to 22,000 atoms and 796,000 rules (a 32 note quar- 
tet). It should be noted that our 1500 lines of code, AnsProlog rules and perl scripts 



combined, contrast with the 8000 lines in Strasheela (Anders 20071 and 88000 in 



Bol dBel 1998 ). 

The question of problem complexity is still open. Clearly there is an NP algorithm 
for generating a piece that meets a set of musical rules and there is no known 
polynomial algorithm for this task. Depending on what constitutes a valid musical 
style, it is likely that the task can be shown to be NP-Complete. Verifying if a certain 
composition satisfies all the musical rules is linear. However from an applications 
standpoint, the exact complexity of the process is of less importance than the run 
time performance. 



keyMode(lydian) . 
chosenNote(l,l,25) . 
chosenNote(l,2,24) . 
chosenNote(l,8,19) . 
chosenNote(l,9,20) . 
chosenNote(l,10,24) 
chosenNote(l,14,29) 
chosenNote(l,15,27) 
chosenNote(l,16,25) 
#const t=16. 
style (solo) . 
part(l) . 

Fig. 6. musing. Ip: An example of a partial piece 
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mode (major) . 
chosenNote(l,l,25) . 
chosenNote(l,2,24) . 
chosenNote(l,3,29) . 
chosenNote(l,4,27) . 
chosenNote(l,5,32) . 
chosenNote(l,6,30) . 
chosenNote(l,7,34) . 
chosenNote(l,8,32) . 
chosenNote(l,9,34) . 
chosenNote(l,10,39) . 
chosenNoteC 1,1 1,37) . 
chosenNote(l,12,34) . 
ChosenNoteC 1,13, 36) . 
ChosenNoteC 1,14, 37) . 
#const t=14. 
style Csolo) . 
partCl) . 

Fig. 7. problems . Ip: An example of a broken piece 



5 Rhythm 

Our system used to be limited in terms of one of the most essential musical param- 
eters: rhythm. All music Anton v. 1.0 generated was based on rules for classical 
polyphony, i.e. the combination of musically unique, independent melodic voices, 
but all events had the same time interval. Within this restriction we are able to 
generate first-species counterpoint up to four independent voices and solo melodies 
as well as homophonic 4-part chorales where the rules for the inner voices could be 
more relaxed from the strict melodic rules that determine each of the four voices 
in a polyphonic setting. 

Rhythm determines at what point in time a particular note in a sequence is 
played, for how long it is played and how much stress this note will get. The 
musical experience is partly determined by the constant inter-change of impact 
and resolution. Both influence the behaviour of musical parameters on micro- and 
macro-structural levels and are expressed within the rhythm of the piece. 

An alternative characterisation of rhythm is as the interplay between the duration 
of notes. The relation between notes can then be expressed in terms of interval 
relationships Allen (1983). Thus temporal interval logic Goranko et al. (2003| ) style 



reasoning can be used to describe a constructive model of rhythm. Our experience 
has shown that access to the full range of interval relations is unnecessary and 
difficult to encode, thus Anton uses a simpler mechanism, that collects many of 
the relationships into one "overlaps" literal. This is suflticient for the rhythms of our 
musical genre and it is this simpler form that we describe here. 

Our representation of rhythm is closely related to Farey sequence. The Farey 
sequence of order n can defined as the sequence of reduced fractions in the range 
[0,1], when in lowest terms, have denominators less than or equal to n, arranged in 
order of increasing size. The n^^ term in the sequence is the set of rational numbers 
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with n as denominator. An example the sequence of order 8 is 

01111121323143525345671 
* ~ 1' 8' 7' 6' 5' 4' 7' 3' 8' 5' 7' 2' 7' 5' 8' 3' 74' 5' 6' 7' 8' T 
The related concept we will use is the filtered Farey sequence, which is a systematic 
filtering of this sequence. In particular we will show how it is used in the descrip- 
tion of rhythm in the next subsection, and indicate the constructive programming 
aspects in Section [5^ 



5.1 Musical Discussion of Rhythm 

The musical areas of harmony and classical counterpoint often have rules about 
the conditions when a particular constellation of notes may occur on the timeline, 
sometimes with inclusion of the events preceding and following. For example, a 
suspended fourth is a dissonant constellation of two or more voices created on a 
strong beat. The voice that suspends the consonant note, i.e. the fourth note of 
the scale suspending the third, needs to be sounding before the dissonance occurs. 
This sound needs also to be consonant. After the dissonance on the strong beat the 
suspending voice needs then to resolve on a weak time interval within the measure. 
That sound again must be a consonance. To help expressing such rules the interval 
relations of 'start', 'end', 'during' and 'overlap' are needed. In music, however, 
the time intervals themselves are never 'neutral'. There are typical alternations of 
strong and weak beats in nearly every musical style that is based on an underlying 
pulse. Those alternating beat patterns form the notion of meter. But, the concept 
of meter in Renaissance style is a particular one and not the same as the concept of 
bars, beats and time signatures that is prevailing since the Baroque era and which 
is still in use today. 

There are generally four different kinds of musical time in the Renaissance. When 
looking at the subdivision of the brevis, which translates into today's double whole- 
note, there are four different options for the composer of that documented 
in the famous treatise 'Ars Nova' by Philippe de Vitry. The brevis can be inter- 
preted either as 3 or 2 semi-breve (today's whole note) and those further into 3 and 
2 subdivisions called minims (today's half note) , so we can subdivide the longa into 
3x2, 2x2, 3x3 or 2x3 minim^ Those different subdivisions were indicated 
in the vocal score using different time-signaturefj^ The 15th century composer Jo- 
hannes Ockeghem wrote his Missa prolationem, a four-voiced polyphonic master- 
piece, using all four different time-signatures together, a different one for each voice, 
while at the same time the voices imitate their lines containing the same melodic 
material using canons and double-canon This complex polyrhythmic structure 



® The order of subdivision is musically important in terms of accentuation and therefore has 
consequences for the treatment of consonance and dissonance. Hence, 3 x 2 is not the same as 
2x3. 

® The four time signatures are called tempus perfectum cum prolationem imperfecta (3 x 2), 
tempus imperfectum cum prolationem imperfecta (2x2), tempus perfectum cum prolationem 
perfecta (3 x 3) and tempus imperfectum cum prolationem perfecta (2 x 3) 

" A canon is a special case of a line that can be imitated simultaneously by another voice n beats 
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'X' marks a downbeat, 'O' marks a 2nd level beat, 'o' marks a 3rd level beat 
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Table 1. Polyrhythmic structure of downbeats from Ockeghem's Missa prolationem 
creating a hyper-meter of 36 beats 

Tempus perfectum cum prolatione imperfecta - 6 measures x 3 x 2 half notes 
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Table 2. Metrical hierarchy from Ockeghem's Missa prolationem expressed as Fgg 



establishes a proportion of note durations as 6:4:9:6 between the voices. The effect 
of superimposing different beat qualities can be seen in Table [l] Only the smaller 
note durations below the minim are not affected by the proportional scaling. They 
remain at the same length for all four voices regardless of their time-signature. 
When transcribed nowadays into common practice notation the voices would need 
36 half notes to complete one large cycle of accent patterns before starting over with 



a common 'downbeat' (de la Motte 1981). The complete cycle therefore translates 
into the filtered Farey Sequence F^q, where each of the ratios in the range of [0...1] 
denotes the onset time of a half note; see the representation for one voice, the Bass, 
in Table [2j According to the time-signature (3 x 2) of that particular voice there are 
three distinct metrical layers that govern the note events occurring on such a grid: 
The top layer gives us the occurrence of the downbeat, which is the event of most 
metrical importance, i.e. no dissonance may occur here unless the voice causing it 
prepares the note on the preceding beat. The second layer denotes the beat level. 
Although of less metrical weight, no dissonant interval should sound here unless it 
is a prepared suspension. The third layer provides the lightest metrical events. Here 
dissonances may occur, for instance in form of a passing note. The rule demanding 
preparation of the dissonance is relaxed on this level and on all further subdivisions 
underneath. 

The hierarchical pattern of one of Ockeghem's voices in Table [2] shows also some 



later while their combination at the same time satisfies the rules on consonance/dissonance. 
Adding support for more sophisticated interval relations in future versions of ANTONwill allow 
us to compose canons. 
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interesting properties that lead to a general method for the construction of such 
tables. Starting with the first metrical level (I) in Table [2j we see that the largest 
denominator is always equal to the number of measures required to come back to 
square one with all the other voices, i.e the number of measures per hyper-meter. 
All the other denominators on level I are divisors of the largest denominator. The 
next level (II) shows us how many subdivisions are contained in the hyper-meter on 
that particular level. This is again indicated by the largest denominator of the level 
and all other denominators are his divisors excluding those already contained on 
all lower indexed levels. This principle repeats itself on all higher indexed metrical 
levels. For each denominator n in the scheme, the numerators also follow a simple 
principle: they traverse the complete ordered list of numbers co-prime to n. 



As we have shown in Boenn (2007) and Boenn (20081, the Farey Sequence is ideal 



for tasks like rhythmic modelling, music performance analysis and music theory. 



Chapter 3 of Hardy and Wright (1938 ) gives a description and proves the properties 
of the Farey Sequence. Its scalability and general independence from the concepts 
of bars and meter is of advantage because it can be applied to numerous different 
musical styles. We have given a glimpse in the above Renaissance example (Table 
[2]) how it could be used to encode polyrhythmic structures. Other examples can 
include African Polyrhythm, Western Classical, Avantgarde and Popular Music, 
Greek Verse Rhythms, Indian Percussion and many more. The principle remains 
always the same. 

The visualisation in Figure [8] clearly points out that the smaller the denominator 
the larger are the symmetrical gaps around the x-position of the ratio, i.e. the 
smaller b of the ratio a/b, the greater the distance. We believe that it is due to 
those relatively large gaps surrounding simpler ratios that they form perceptually 
useful zones of attraction for more complex ratios that fall into them or that come 
close enough. It is left for future field studies to measure and to find evidence 
whether these zones of attraction are perceptually relevant or not. Composers who 
want to "stay away" from those simple ratios will need to leave a considerable 
amount of space around these zones of attraction. It becomes also clear that there 
are various accelerandi and ritardandi encoded in every F„, for example there are 
gestaltf[^ that form visible triangles between larger reciprocals and smaller ones 
in their surrounding area (Figure [s]). These gestalts are formed by monotonically 
increasing or decreasing values of the denominators. The increasing or decreasing 
tendencies overlap each other. The gaps between the ratios forming those triangles 
are always on a logarithmic scale, hence the impression of accelerated or reduced 
tempo that becomes evident through the sonification of these triangular gestalts. It 
is well known that timed durations need to be placed on a logarithmic rather than 
linear scale in order to convey a "natural" sense of spacing and tempo modification. 
These structures are clearly mirrored around the 1/2 value that is part of every Fn 
with n > 1. 

The great variety of styles and concepts from medieval to modern eras can be 



That is the concept of shape or configuration of a whole perceived object 
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b 0.3 




Fig. 8. Correlation of a/b e Fu and 1/6 in the interval (0, 1) 



realised by applying intelligent filtering methods to sieve through the sequences. Ex- 
amples for filtering include probabilistic methods and filters exploiting the prime 
number composition of integers and ratios, for example b-smooth numbers, or 



Clarence Barlow's function for the 'Indigestibility' of a natural integer (Hajdu 1993 1 



The Farey Sequence has been known for a while in the area of musical tuning sys- 
temj^ Its use for rhythmic modelling has not been fully exploited yet; Anton is 
the first music application for composition where rhythms and musical forms will be 
generated on the basis of the principles outlined in this paper. The Farcy Sequence 
Fn is per se highly symmetrical and unfolds harmonic subdivisions of unity via a 
recursive calculation of mediant fractionj^ Every music that depends on an un- 
derlying beat or pulsation can be represented by using to denote the normalised 
occurrence of musical events, e.g. note onsets. 

Off-Beat rhythms are extremely useful for generative purposes. Two or more of 
these rhythms stacked in layers can always generate new combinations by using 
different accentuation patterns and different dynamic processes. Again, the Farey- 
Sequence proves to be a very useful structure to realise this concept. The details 



are beyond what is necessary in this paper, but in Boenn (2009 ) it is shown that 
many styles like Bebop, Funk and 20th century Avantgarde, are modelled by this 
mechanism. Speech rhythm, as used in various musical styles are also within the 
scope. 



Finally, Sima Arom's study (Arom 1991) on African Polyrhythms has been very 



For e xample Erv Wilson's annotations of tunings used by [Partch (1979| 

http://www.anaphoria.com/wilson.html 



See http:/ /mathworld. wolfram. com/l^'areySequence. html 
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influential on contemporary western composers because of his successful recording 
and transcription processes that form the basis of his further analysis. We are 
seeking to translate some of these principles into features for Anton for creative 
purposes but also in order to proof the general use of our concept. But this is for 
the future (Section [s]). 

The main message is that the Farey sequence contains all that is necessary for 
a very wide range of rhythmic patterns. With an encoding of them in AnsProlog 
we have all the infrastructure we need. 



5.2 Rhythm in Anton 

The question now arises how we can combine this generative model for rhythm 
with the rules for melody, counterpoint and harmony that have been already im- 
plemented in Anton. Of central importance for musical experiences in our view is 
the constant inter-change of impact and resolution that influences the behaviour of 
musical parameters on micro- and macro-structural levels. One can compare impact 



with gathering musical energy ( Thakar 1990 ) and resolution with the release of pre 



viously built-up energy. We were very careful to make sure that the melodic lines 
generated are following this principle. With the encoding of rhythm we have now 
the possibility to precisely control the timing aspect of when to turn the musical 
movement from impact to resolution. 

Rhythm in Anton v. 1.5 is the first prototype and an improved version is under 
development. As mentioned earlier, the encoding of rhythm is based on the concept 
of Farey sequences. One can view each rational number in the sequence as a way of 
partitioning the range [0,1], and as any particular partitioning can be constructed 
from a series of partitioning by a rational number with a prime denominator, the full 
collection of possible rhythms can be considered as all possible trees of partitions. 
In the initial version we limit the partition denominators to be taken to be only the 
primes 2 and 3, which allows a sufficiently rich subset of trees (and hence rhythms) 
for our musical genre. The encoding allows other primes, at some computational 
cost. 

While the Farey sequence is a useful form for traversing the rhythmic structure, 
the partitioning tree is a useful (computational) way of building up a filtered Farey 
sequence by using a top-down approach, starting with a section of time and working 
it down by way of making subdivisions. The filtering of the Farey sequence is then 
achieved through the constraint of allowing only small primes as subdividing num- 
bers within the tree, i.e. 2 and 3 in our initial implementation of rhythm. The further 
advantage of having a tree is that it allows us to have an easy access to different 
metrical levels (measure, beat, notes), which is vital for the later addition of rules 
about impact/resolution and for rules governing usage of consonance/dissonance. 
An example of a partitioning tree can be found in Figure |10[ Each part is divided 
in a number of measures, these forms the top layer of the tree (represented by rect- 
angles in the example below). Each measure is then divided in a number of beats 
(diamond shapes), which control the emphasis of notes (the weight of shading). The 
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°/o°/( Each Farey tree has a given depth 

depthCF,MD + BD + DD) :- measureDepthCMD) , beatDepthCF , BD) , durationDepthCF, DD) . 
level (F, 1. .DE) :- depth (F.DE) . 

°/o°/o Each Farey tree is divided into three layers (top to bottom) 

°/o°/o Measure, beats and note duration 

'/o'/t (bars, time signature and note value) 

measureLevelCF,FL) :- depth(F,DE), durationDepthCF, DD) , beatDepthCF,BD) , 

level (F,FL), FL <= DE - (DD + BD) . 
measureLeaf Level CF,DE - (DD + BD)) :- depth (F, DE) , durationDepth(F ,DD) , beatDepth(F,BD) . 

%°/o Map from nodes to time positions 

°/o°/o Mapping increments each time a node is present 

nodeStepCF,0,l) . 

nodeStepCF,ND,T) :- not present (F , DLL , MD) , nodeStep (F,ND-1 , T) , 

node(F,DLL,ND) , durationLeaf Level (F , DLL) , ND > 0. 
nodeStepCF,ND,T+l) :- present (F, DLL, ND) , nodeStepCF , ND-1 ,T) , 

node(F,DLL,ND) , durationLeaf Level CF , DLL) , ND > 0. 
%°/o From this we derive a unique mapping from node to time step 
timeToNode(P,l,0) . 

timeToNode(P,T,ND) :- present (F, DLL, ND) , nodeStepCF ,ND-1 ,T-1) , 

node(F,DLL,ND) , durationLeaf Level CF , DLL) , ND > 0, 
partToFareyTree (P , F) . 

%°/o Beat strength is created at the first level of the beat layer . . . 
nodeBeatStrength(F,MLL+l,ND2,l) :- measureLeaf Level (F,MLL) , node (F ,MLL , NDl) , 

descendant (F , , MLL , NDl , MLL+1 , ND2) . 
nodeBeatStrength(F,MLL+l,ND2,0) :- measureLeaf Level (F, MLL) , node (F , MLL , NDl) , 

descendant(F, D, MLL, NDl, MLL+1, ND2) , D != 0. 

%'/( Lowest and highest notes must also be slower 
playsHeighestNote(P,T) :- chosenNote(P,T,N) , lowestNote (P , N) . 
playsLowestNoteCP,T) :- chosenNote CP , T ,N) , lowestNote (P , N) . 

:- playsHeighestNote(P,T) , timeStepDuration(P , T ,DS) , DS > 1. 

:- playsLowestNoteCP,T) , t imeStepDurationCP ,T, DS) , DS > 1. 

Fig. 9. A small rhythm code fragment 

notes (circles) of the part are then grouped w.r.t. duration and placed within their 
respective beats. Individual notes are the leaves of the tree. 

The current version, Anton v. 1.5, picks an arbitrary tree from a constrained set 
of possibilities and imposes it on the output, so all parts have the same rhythm. In 
future versions, melodic and harmonic changes will interact with the decision over 
which tree is chosen. The newest version, which is not yet released, extends this by 
adding rules to deal with beats, their strength, duration and the interplay between 
parts. These rules allow the parts to have independent, but related rhythms. 

Figure |9] contains a small code fragment of the current rhythm section of the 
system. The fragments show a portion of the structure of the partition tree and its 
three layers of measures, beats and notes. Each of these layers have separate rules 
that govern them. The fragment also shows the mapping from notes to time in- 
stances. We have included the definition of beat strength and one of the constraints 
linking tone and speed. 

While not shown in this code fragment, the rhythm code is, like the other sections, 
written in such a way that diagnosis is possible. 

In order to include rhythm in a composition, it suffices the include option — rhythm 
to the programBuilder .pi script. The command: 

. /programBuilder .pi — task=compose — mode=lydian — time=12 — style=duet 
— rhythm > rhythm-comp 

composes a duet in Lydian with 12 notes per part. 
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Fig. 10. The partitioning tree and human readable format for the Lydian duet 
composition. 

The program can then be parsed using parse . Ip to generate Csound, Lilypond 
and human readable output. Figure [TO] shows the partitioning tree and the human 
score for the Lydian duet. 

6 Evaluation 

In this section we evaluate the performance of Anton both from a computational 
and a musical perspective. We mainly focus on the non-rhythm part of the released 
version 1.5 for stability and reproducibility. 



6.1 Run-time Results 

To evaluate the practicality of using answer set programming in a composition 
system we timed Anton v. 1.5 without the rhythm version while composing a suite 
of score with increasing difficulty. 

Tables 3]|6 contains the timings for a number of answer set solvers (clasp ( Gebser 
et al. 2007|, cmodels (ILierler and Maratea 20041), smodels (ISyrjanen and Niemela 



2001), SMODELS-IE (Brain et al. 20071, SMODELSCC (Ward and Schlipf 2004), and 



SUP (Lierler 2008)) in composing solos, duets, trios and quartets of a given length. 
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Solvers 



Length Clasp 1.2.1 Cmodels 3.79 Smodels 2.33 Smodcls-IE 1.0.0 Smodcls.cc l.OS Sup 0.4 



4 0.02 0.09 0.04 0.02 0.10 0.04 

8 0.18 0.38 1.27 0.52 4.55 0.16 

12 0.48 1.10 8.99 2.87 27.45 0.64 

16 1.05 2.06 36.03 10.19 86.56 1.01 

20 2.68 3.11 32.52 10.02 93.61 2.06 

24 2.42 4.22 193.40 58.06 Time out 2.11 

28 3.84 5.80 239.49 80.56 Time out 4.02 

32 3.90 7.11 305.05 102.91 Time out 4.66 



Table 3. Time taken (in seconds) for a number of solvers generating a solo piece. 



Solvers 



Length Clasp 1.2.1 Cmodels 3.79 Smodels 2.33 Smodels-IE 1.0.0 Smodels.cc 1.08 Sup 0.4 



4 0.14 0.28 0.23 0.10 0.64 0.14 

8 0.43 0.98 10.60 4.92 77.07 0.57 

12 2.28 2.54 Time out Time out Time out 2.18 

16 1.91 3.92 Time out Time out Time out 3.33 

20 3.12 6.58 Time out Time out Time out 7.86 

24 8.60 8.71 Time out Time out Time out 13.55 

28 14.94 19.29 Time out Time out Time out 31.05 

32 13.56 26.90 Time out Time out Time out 31.63 



Table 4. Time taken (in seconds) for a number of solvers generating a duet piece. 



The experiments were run using a 2.4Ghz AMD Athlon X2 4600+ processor, 
running a 64 bit version of OpenSuSE 11.1. All solvers were built in 32 bit mode. 
Each test was limited to 10 minutes of CPU time and 2Gb of RAM. Programs were 
ground using GrinGo 2.0.3 and grounding times of between 0.5 and 7 seconds 
were excluded. They were not reported for two distinct reasons: first, the grounder 
times are constants as we used the same grounder for each solver and second, 
grounding is set-up time. Just like in a life concert all the equipment has already 
been set before the concert, so should grounding be considered separately from the 
real-time composition. All solvers were run using default options, except cmodels 
which was set to use the MiNiSAT 2.0 back end as opposed to the default (zchaff). 
The programs used are available from http : //www . cs . bath . ac . uk/~injb/antoii| 

The results show a significant increase in performance from Anton v. 1.0 reported 
in (Boenn et al. 20081. Then we were only able to compose duets up to length 
16, which took 29.63 seconds using the fastest solver clasp. The current system, 
Anton v. 1.5, only takes 1.01 seconds for the same composition using the same 
solver. Furthermore, we can now compose trios and quartets within a reasonable 
time frame. 

While improvements in the underlying solvers are a factor in the steep increase in 
performance, they are not the main contributor. We obtained most of our increases 



Solvers 



Length Clasp 1.2.1 Cmodels 3.79 Smodels 2.33 Smodels-IE 1.0.0 Smodels_cc 1.08 Sup 0.4 



4 0.26 0.30 0.27 0.04 0.30 0.23 

8 0.78 1.52 3.42 1.34 10.39 0.96 

12 2.17 3.20 22.44 S.52 77.80 2.88 

16 2.81 5.35 104.68 39.20 Time out 4.55 

20 6.95 7.63 Time out Time out Time out 9.01 

24 11.90 10.92 Time out Time out Time out 9.75 

28 41.28 12.32 Time out Time out Time out 10.68 

32 52.98 19.33 Time out Time out Time out 47.94 



Table 5. Time taken (in seconds) for a number of solvers generating a trio piece. 
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Solvers 



Length Clasp 1.2.1 Cmodels 3.79 Smodels 2.33 Smodels-IE 1.0.0 Smodels.cc l.OS Sup 0.4 



4 0.43 0.55 0.49 0.08 0.63 0.55 

8 1.14 2.55 7.98 3.28 32.20 2.70 

12 4.24 5.38 35.11 13.07 128.75 5.46 

16 9.59 8.64 336.36 126.21 Time out 16.86 

20 69.37 11.87 Time out Time out Time out 17.44 

24 194.73 20.44 Time out Time out Time out 30.99 

28 246.10 19.32 Time out Time out Time out 79.13 

32 Time out 46.61 Time out Time out Time out 113.30 



Table 6. Time taken (in seconds) for a number of solvers generating a quartet piece. 



by revisiting AnsProlog encoding and finding more compact representations. We 
have compacted the rule set of minor keys which gives some reduction in space and 
run time. We reformatted some of the harmony rules and relaxed them so they 
only apply to neighbouring parts rather than all parts. The removal of redundant 
constraints compacts the program by a surprising amount. The rewriting of the 
repeated notes section produced a massive increase in grounding while the improved 
encoding of highest and lowest note saved us about 150,000 grounded rules on 16 
note duet and about 30% in run time. Ranges over two octaves is now only noted 
at the end of the program, rather than at the point at which it is triggered. While 
this is slightly less informative, it offers a more than significant speed-up. 

These results show that the system, using the more powerful solvers, is not only 
fast enough to be used as a component in an interactive composition tool but, when 
restricting to shorter sequences, could be used for real-time generation of music. 

It is also interesting to note that the only solvers able to generate longer sequences 
using two or more parts all implement clause learning strategies, suggesting that 
the problem is particularly susceptible to this kind of technique. 

We have not included run-time results for the rhythm section. The implementa- 
tion as this feature is still being explored and the results would not be representative. 



6.2 Music Quality 

The other way to evaluate the system is to judge the music it produces. This is a 
less certain process, involving personal values. However we feel that the music is 
acceptable, at least of the quality of some students of composition, and at times 
capable of moments of excitement. Pieces by Anton v. 1.0 have been played to a 
number of musicians, who apart from the rhythmic deficiency we are addressing 
have agreed that it is valid music. The introduction of rhythm is more recent, and 
consequently it has not been subjected to so much scrutiny. There are still some 
refinements that could improve the output but many of the short pieces are clearly 
valid, and musical. In figure [TT] we present a short quartet sequence in the minor 
key, followed by four major key pieces composed using the Anton system; the 
audio and score can be found in the same location as the other works. 
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Music engraving by LilyPond 2.10.29 — www.lilypond.org 



The interested reader can find more examples on the web: 
http : / /dream . cs . bath . ac . uk/ Anton^ 

Fig. 11. Fragments by Anton 



7 The Use of ASP in Anton 
7.1 Why ASP? 

While music appreciation is matter of personal taste, musicologists use sets of rules 
which determine to which style a musical composition belongs or whether a piece 
breaks or expands the common practice of a certain composer or era. These sets 
of rules also govern the composition. So an intuitive and obvious approach to au- 
tomatic composition is to encode these rules and use a rule based algorithm to 
produce valid music compositions. This natural and simple way of encoding things 
is shown in terms of speed of development, roughly 2 man-months, sophistication 
of the results, the amount of code (about 500 lines of code) and flexibility; we 
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can not only easily encode different styles but the same application is usable not 
only for automated composition but also diagnosis and human assisted composi- 
tion. Furthermore, we automatically gain from all improvements in the underlying 
solver. 



7.2 Answer Set Synthesis 

The conventional view of the ASP paradigm is in terms of problem solving. The 
problem is described using AnsProlog in such a way that the answer sets can be 
interpreted as the solutions to the problem. So the program provides the specifica- 
tion for a solution; the constraints that elements in the search space have to satisfy. 
For Anton we use ASP for synthesis rather than problem solving. We are doing 
something subtly, but importantly different; we are using a solver to generate repre- 
sentative objects given a specification. This is more knowledge representation than 
problem solving, since the language is used to provide a computational description 
of the required object. We believe that this approach. Answer Set Synthesis opens 
new of possibilities for the use of AnsProlog as there are lots of applications for 
which we need parametrisable, consistent but not necessarily exceptional content. 
Take for example, the automatic generation of a virtual world: one specifies the 
components of the environment and their dependencies and the program will gen- 
erate you the possible locations. Generation of test data and puzzles are just a few 
more examples. 



7.3 ASP Methodology 

In constructing Anton a number of advantages of using answer set programming 
have become clear, as have a number of limitations. 

Firstly, AnsProlog programs are very fast to write and very compact. As well 
as the obvious benefits, this means it is possible to develop the system at the same 
time as undertaking knowledge capture and to prototype features in the light of the 
advice of domain experts. Part of the reason why it is so fast to use is that rules 
are fully declarative. Programming thus focuses on expressing the concepts that are 
being modelled rather than having to worry about which order to put things in — 
such as which rules should come first, which concepts have higher priority, which 
choices should be made first, etc. This also makes incremental development easy as 
new constraints can be added one at a time, without having to consider how they 
affect the search strategy. 

Being able to add rules incrementally during development turns out to be ex- 



tremely useful (Brain et al. 20091 from a software engineering view point. During 



the development of Anton, we experimented with a number of different develop- 



ment methodologies. As argued in ( ^Cliffe et al. 2008 1, "visualisation" of answer sets 



is a very productive way bridging the gap between the problem domain and the pro- 
gram. For the musical application domain, the most effective approach was found 
to be first writing a script that translates answer sets to human readable score or 
output for a synthesiser. Next the choice rules were added to the program to create 



Music Composition using ASP 



27 



all possible pieces, valid or not. Finally the constraints were incrementally added to 
restrict the output to only valid sequences. By building up a library of valid pieces 
it was possible to perform regression testing at each step and thus isolate bugs as 
soon as they were introduced. 

Using answer set programming was not without issue. One persistent problem 
was the lack of mature development support tools, particularly debugging tools. 



SPOCK (Brain et al. 20071 was used but as its focus is on computing the reasons 
behind the error, rather than the interface issues of explaining these reasons to the 
user, it was normally quicker to find bugs by looking at the last changes made and 
which regression tests failed. Generally, the bugs that where encountered were due 
to subtle mismatches between the intended meaning of a rule and the declarative 
reading of the rule used. For example the predicate stepUp(P,T) is used to repre- 
sent the proposition "At time T, part P steps up to give the note at time T+l", 
however, it could easily be misinterpreted as "At time T-1, part P steps up to give 
the note at time T". Which of these is used is not important, as long as the same 
declarative reading is used consistently for all rules. With the first "meaning" se- 
lected for Anton, the rule: 

chosenNote(l,T,N+S) :- chosenNote (P,T-1 ,N) , stepUp(P,T), 

stepBy(P,T,S) . 

would not encode the intended progression of notes. Even for experienced pro- 
grammers, maintaining consistency in the semantics of predicates can be hard. One 
possible way of supporting a programmer in avoiding these subtle errors would be to 
develop a system that translated rules into natural language, given the declarative 
reading of the propositions involved. It should then be relatively straightforward to 
check that the rule encoded what was intended. 



8 Conclusions and Future Work 

We have presented an algorithmic composition system that uses AnsProlog to 
describe rules captured from classical texts on first species counterpoint. The early 



system of Boenn et al. (2008 ) has been developed in two significant ways: greatly 
improved performance and the introduction of a coherent rhythm schema. However 
Anton still has considerable scope for further development and enhancement. 

The encoding of rhythm is still an area of active research. Although we now 
have a basic encoding of rhythm it still requires a lot of fine tuning. The use of 
Farey sequences and partitioning trees is the core of this. In Section [5] we already 
indicated that we intend to encode Sima Arom's principles which he discovered 
when studying African Polyrhythms. The use of partitioning trees will need to be 
further developed to do so. 

We have concentrated on first species counterpoint from an early time, but many 
of the rules apply to other styles. We have made a few experiments with rules for 
Bach chorales and for hymn tunes. We need to partition the current rule-sets into 
building blocks to facilitate reuse. 

The current system can write short melodies effectively and efficiently. Devel- 
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opment work is still needed to take this to entire pieces. We can start from these 
melodic fragments but a longer piece needs a variety of different harmonisations 
for the same melody, and related melodies with the same harmonic structure and 
a number of similar techniques. We have not solved the difficult global structure 
problem but it does create a starting point on which we can build a system that is 
hierarchical over time scales; we have a mechanism for building syntactically correct 
sentences, but these need to be built into paragraph and chapters, as it were. It is 
not clear if this will be achieved within the current ASP system, or by a procedural 
layer built on top if it, or some other scheme. 

To make the system more user-friendly, there is a need for a user interface, prob- 
ably graphical, to select from the options and styles. We have avoided this so far, 
as the Musician-Machine Interface is a separate specialist area, but there are plans 
for such an interface to be designed in the next phase. 

In real life pieces some of the rules are sometimes broken. This could be simulated 
by one of a number of extensions to answer set semantics (preferences, consistency 
restoring rules, defensible rules, etc.). However how to systematize the knowledge 
of when it is acceptable to break the rules and in which contexts it is 'better' to 
break them is an open problem. 

For Anton we used AnsProlog as a computational description language rather 
than just a knowledge representation one. There is a tempting possibility to apply 
the same methodology and approach to other areas of content, such as maybe game 
map generation. Initial experiments show promise. 
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